Search CORE

8 research outputs found

From Physics Model to Results: An Optimizing Framework for Cross-Architecture Code Generation

Author: Blazewicz Marek
Brandt Steven R.
Ciznicki Milosz
Hinder Ian
Kierzynka Michal
Koppelman David M.
Löffler Frank
Schnetter Erik
Tao Jian
Publication venue: 'IOS Press'
Publication date: 01/01/2013
Field of study

Starting from a high-level problem description in terms of partial differential equations using abstract tensor notation, the Chemora framework discretizes, optimizes, and generates complete high performance codes for a wide range of compute architectures. Chemora extends the capabilities of Cactus, facilitating the usage of large-scale CPU/GPU systems in an efficient manner for complex applications, without low-level code tuning. Chemora achieves parallelism through MPI and multi-threading, combining OpenMP and CUDA. Optimizations include high-level code transformations, efficient loop traversal strategies, dynamically selected data and instruction cache usage strategies, and JIT compilation of GPU code tailored to the problem characteristics. The discretization is based on higher-order finite differences on multi-block domains. Chemora's capabilities are demonstrated by simulations of black hole collisions. This problem provides an acid test of the framework, as the Einstein equations contain hundreds of variables and thousands of terms.Comment: 18 pages, 4 figures, accepted for publication in Scientific Programmin

arXiv.org e-Print Archive

CiteSeerX

Directory of Open Access Journals

Louisiana State University

MPG.PuRe

Energy aware scheduling model and online heuristics for stencil codes on heterogeneous computing architectures

Author: AA Chandio
AD Pereira
CE Shannon
G Terzopoulos
I Holyer
IM Bomze
J Mei
J Treibig
Jan Weglarz
K Bilal
K Datta
K Kurowski
KA Rojek
Krzysztof Kurowski
M Blazewicz
M Ciznicki
M Ciznicki
M Ciznicki
M Ciznicki
Milosz Ciznicki
S Sellappa
S Williams
VG Vizing
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The ESCAPE project : Energy-efficient Scalable Algorithms for Weather Prediction at Exascale

Author: Baldauf Michael
Bauer Peter
Berg Per
Bosak Bartosz
Bénard Pierre
Błażewicz Marek
Ciesielski Sebastian
Ciznicki Milosz
Clement Valentin
Colavolpe Charles
Deconinck Willem
Degrauwe Daan
Diamantakis Michail
Douriez Louis
Fuhrer Oliver
Gillard Mike
Glinton Michael
Gray Alan
Guibert David
Hamrud Mats
Kulczewski Michał
Kurowski Krzysztof
Kühnlein Christian
Lange Michael
Lock Sarah-Jane
Lysaght Michael
Macfaden Alexander J
Marguinaud Philippe
Mazauric Cyril
McKinstry Alastair
Mengaldo Gianmarco
Messmer Peter
Mozdzynski George
Müller Andreas
New Nick
Nielsen Kristian P
O'Brien Enda
Osuna Carlos
Piotrowski Zbigniew P
Piątek Wojciech
Poulsen Jacob W
Procyk Marcin
Raffin Erwan
Robinson Oisín
Saarinen Sami
Sass Bent H
Shukla Parijat
Smet Geert
Smolarkiewicz Piotr K
Spychala Pawel
Szmelter Joanna
Termonia Piet
Thiemert Daniel
Van Bever Joris
Vigouroux Xavier
Voitus Fabrice
Wedi Nils
Wyszogrodzki Andrzej
Zheng Yongjun
Publication venue: 'Copernicus GmbH'
Publication date: 01/01/2019
Field of study

In the simulation of complex multi-scale flows arising in weather and climate modelling, one of the biggest challenges is to satisfy strict service requirements in terms of time to solution and to satisfy budgetary constraints in terms of energy to solution, without compromising the accuracy and stability of the application. These simulations require algorithms that minimise the energy footprint along with the time required to produce a solution, maintain the physically required level of accuracy, are numerically stable, and are resilient in case of hardware failure. The European Centre for Medium-Range Weather Forecasts (ECMWF) led the ESCAPE (Energy-efficient Scalable Algorithms for Weather Prediction at Exascale) project, funded by Horizon 2020 (H2020) under the FET-HPC (Future and Emerging Technologies in High Performance Computing) initiative. The goal of ESCAPE was to develop a sustainable strategy to evolve weather and climate prediction models to next-generation computing technologies. The project partners incorporate the expertise of leading European regional forecasting consortia, university research, experienced high-performance computing centres, and hardware vendors. This paper presents an overview of the ESCAPE strategy: (i) identify domain-specific key algorithmic motifs in weather prediction and climate models (which we term Weather & Climate Dwarfs), (ii) categorise them in terms of computational and communication patterns while (iii) adapting them to different hardware architectures with alternative programming models, (iv) analyse the challenges in optimising, and (v) find alternative algorithms for the same scheme. The participating weather prediction models are the following: IFS (Integrated Forecasting System); ALARO, a combination of AROME (Application de la Recherche a l'Operationnel a Meso-Echelle) and ALADIN (Aire Limitee Adaptation Dynamique Developpement International); and COSMO-EULAG, a combination of COSMO (Consortium for Small-scale Modeling) and EULAG (Eulerian and semi-Lagrangian fluid solver). For many of the weather and climate dwarfs ESCAPE provides prototype implementations on different hardware architectures (mainly Intel Skylake CPUs, NVIDIA GPUs, Intel Xeon Phi, Optalysys optical processor) with different programming models. The spectral transform dwarf represents a detailed example of the co-design cycle of an ESCAPE dwarf. The dwarf concept has proven to be extremely useful for the rapid prototyping of alternative algorithms and their interaction with hardware; e.g. the use of a domain-specific language (DSL). Manual adaptations have led to substantial accelerations of key algorithms in numerical weather prediction (NWP) but are not a general recipe for the performance portability of complex NWP models. Existing DSLs are found to require further evolution but are promising tools for achieving the latter. Measurements of energy and time to solution suggest that a future focus needs to be on exploiting the simultaneous use of all available resources in hybrid CPU-GPU arrangements

Loughborough University Institutional Repository

Ghent University Academic Bibliography

The ESCAPE project: Energy-efficient Scalable Algorithms for Weather Prediction at Exascale

Author: Alan Gray (3276414)
Alastair McKinstry (7533326)
Alexander J Macfaden (7533380)
Andreas Müller (352087)
Andrzej Wyszogrodzki (7533362)
Bartosz Bosak (7533356)
Bent H Sass (7533305)
Carlos Osuna (5747222)
Charles Colavolpe (7533284)
Christian Kühnlein (7533260)
Cyril Mazauric (7533368)
Daan Degrauwe (7533293)
Daniel Thiemert (7533272)
David Guibert (7533371)
Enda O'Brien (7533323)
Erwan Raffin (7533365)
Fabrice Voitus (7533281)
Geert Smet (7533296)
George Mozdzynski (7213583)
Gianmarco Mengaldo (7533263)
Jacob W Poulsen (7533308)
Joanna Szmelter (1250304)
Joris Van Bever (7533290)
Kristian P Nielsen (7533302)
Krzysztof Kurowski (372442)
Louis Douriez (7533374)
Marcin Procyk (7533353)
Marek Błażewicz (7533350)
Mats Hamrud (7213580)
Michael Baldauf (7533320)
Michael Glinton (7533275)
Michael Lange (3094497)
Michael Lysaght (7533335)
Michail Diamantakis (5774088)
Michał Kulczewski (7533338)
Mike Gillard (1260393)
Milosz Ciznicki (7533341)
Nick New (7533383)
Nils Wedi (5774006)
Oisín Robinson (7533329)
Oliver Fuhrer (7533314)
Parijat Shukla (7533332)
Pawel Spychala (746616)
Per Berg (7533311)
Peter Bauer (524851)
Peter Messmer (7533377)
Philippe Marguinaud (7533287)
Pierre Bénard (7533278)
Piet Termonia (7533299)
Piotr K Smolarkiewicz (7533266)
Sami Saarinen (7533269)
Sarah-Jane Lock (5774078)
Sebastian Ciesielski (7533347)
Valentin Clement (7533317)
Willem Deconinck (7213469)
Wojciech Piątek (7533344)
Xavier Vigouroux (5547218)
Yongjun Zheng (1660624)
Zbigniew P Piotrowski (7533359)
Publication venue
Publication date: 22/10/2019
Field of study

Abstract. In the simulation of complex multi-scale flows arising in weather and climate modelling, one of the biggest challenges is to satisfy strict service requirements in terms of time to solution and to satisfy budgetary constraints in terms of energy to solution, without compromising the accuracy and stability of the application. These simulations require algorithms that minimise the energy footprint along with the time required to produce a solution, maintain the physically required level of accuracy, are numerically stable, and are resilient in case of hardware failure. The European Centre for Medium-Range Weather Forecasts (ECMWF) led the ESCAPE (Energy-efficient Scalable Algorithms for Weather Prediction at Exascale) project, funded by Horizon 2020 (H2020) under the FET-HPC (Future and Emerging Technologies in High Performance Computing) initiative. The goal of ESCAPE was to develop a sustainable strategy to evolve weather and climate prediction models to next-generation computing technologies. The project partners incorporate the expertise of leading European regional forecasting consortia, university research, experienced high-performance computing centres, and hardware vendors. This paper presents an overview of the ESCAPE strategy: (i) identify domain-specific key algorithmic motifs in weather prediction and climate models (which we term Weather & Climate Dwarfs), (ii) categorise them in terms of computational and communication patterns while (iii) adapting them to different hardware architectures with alternative programming models, (iv) analyse the challenges in optimising, and (v) find alternative algorithms for the same scheme. The participating weather prediction models are the following: IFS (Integrated Forecasting System); ALARO, a combination of AROME (Application de la Recherche à l'Opérationnel à Meso-Echelle) and ALADIN (Aire Limitée Adaptation Dynamique Développement International); and COSMO–EULAG, a combination of COSMO (Consortium for Small-scale Modeling) and EULAG (Eulerian and semi-Lagrangian fluid solver). For many of the weather and climate dwarfs ESCAPE provides prototype implementations on different hardware architectures (mainly Intel Skylake CPUs, NVIDIA GPUs, Intel Xeon Phi, Optalysys optical processor) with different programming models. The spectral transform dwarf represents a detailed example of the co-design cycle of an ESCAPE dwarf. The dwarf concept has proven to be extremely useful for the rapid prototyping of alternative algorithms and their interaction with hardware; e.g. the use of a domain-specific language (DSL). Manual adaptations have led to substantial accelerations of key algorithms in numerical weather prediction (NWP) but are not a general recipe for the performance portability of complex NWP models. Existing DSLs are found to require further evolution but are promising tools for achieving the latter. Measurements of energy and time to solution suggest that a future focus needs to be on exploiting the simultaneous use of all available resources in hybrid CPU–GPU arrangements

Loughborough University Institutional Repository

Evaluation of Selected Resource Allocation and Scheduling Methods in Heterogeneous Many-Core Processors and Graphics Processing Units

Author: Ciznicki Milosz
Kurowski Krzysztof
Węglarz Jan
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/12/2014
Field of study

Heterogeneous many-core computing resources are increasingly popular among users due to their improved performance over homogeneous systems. Many developers have realized that heterogeneous systems, e.g. a combination of a shared memory multi-core CPU machine with massively parallel Graphics Processing Units (GPUs), can provide significant performance opportunities to a wide range of applications. However, the best overall performance can only be achieved if application tasks are efficiently assigned to different types of processor units in time taking into account their specific resource requirements. Additionally, one should note that available heterogeneous resources have been designed as general purpose units, however, with many built-in features accelerating specific application operations. In other words, the same algorithm or application functionality can be implemented as a different task for CPU or GPU. Nevertheless, from the perspective of various evaluation criteria, e.g. the total execution time or energy consumption, we may observe completely different results. Therefore, as tasks can be scheduled and managed in many alternative ways on both many-core CPUs or GPUs and consequently have a huge impact on the overall computing resources performance, there are needs for new and improved resource management techniques. In this paper we discuss results achieved during experimental performance studies of selected task scheduling methods in heterogeneous computing systems. Additionally, we present a new architecture for resource allocation and task scheduling library which provides a generic application programming interface at the operating system level for improving scheduling polices taking into account a diversity of tasks and heterogeneous computing resources characteristics

Directory of Open Access Journals

Methods to Load Balance a GCR Pressure Solver Using a Stencil Framework on Multi- and Many-Core Architectures

Author: Krzysztof Kurowski
Michal Kulczewski
Milosz Ciznicki
Piotr Kopta
Publication venue: Hindawi Limited
Publication date: 01/01/2015
Field of study

The recent advent of novel multi- and many-core architectures forces application programmers to deal with hardware-specific implementation details and to be familiar with software optimisation techniques to benefit from new high-performance computing machines. Extra care must be taken for communication-intensive algorithms, which may be a bottleneck for forthcoming era of exascale computing. This paper aims to present a high-level stencil framework implemented for the EULerian or LAGrangian model (EULAG) that efficiently utilises multi- and many-cores architectures. Only an efficient usage of both many-core processors (CPUs) and graphics processing units (GPUs) with the flexible data decomposition method can lead to the maximum performance that scales the communication-intensive Generalized Conjugate Residual (GCR) elliptic solver with preconditioner

Directory of Open Access Journals

A novel 3- level energy heterogeneity clustering protocol with hybrid routing for a concentric circular wireless sensor network

Author: A. Chithra
G Smaragdakis
H González
K Romer
K Vivek
L Qing
M Saeidmanesh
Manju Bala
Milosz Ciznicki
R Saravanakumar
R. Shantha Selva Kumari
S Faisal
S Mo
S Sirsikar
SP Singh
Vrinda Gupta
WR Heinzelman
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref